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Abstract 

This paper considers the asymptotic power of likehhood ratio test 
(LRT) for the identity test when the dimension p is large compared to 
the sample size n. The asymptotic distribution of LRT under alterna- 
tives is given and an explicit expression of the power is derived when 
tr(Sp) — log |Ep| — p tends to a constant. A simulation study is carried 
out to compare LRT with other tests. All these studies show that LRT is 
a powerful test to detect eigenvalues around zero. 
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1 Introduction 



In multivariate analysis for high dimensional data, testing the structure of 
population covar ia nce matrices is an imp o rtant probl e m. S e e, for e x ample , 



Johnstond (l200lh . 



Chen et al\mm 



Ledoit and W ol? ("200^, ISrivastaval (l2Q05l). jSchotd (|2006l) . 



Cai and Jiana (,201 if ) and lLi and ChenI (|2012[) . among many 



others. Let Xi, • • • ,Xn be n independent and identically distributed (i.i.d.) p- 
variate random vectors following a multivariate normal distribution Np{iJL, Yip) 
where ^ is the mean vector and Sp is the covariance matrix. In many studies, 
a hypothesis test of significant interest is to test 
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where Ip is the p-dimensional identity matrix. Note that the identity matrix in 
([l} can be replaced by any other positive definite matrix Sq through muhiplying 
the data by 

A natural approach to test ^ is to conduct estimations for some distance 
measures between Sp and Ip and there are two types of measures which are 
widely used in literature. The first is based on the likelihood function: 

LKSp)=tr(Sp)-log|I]p|-p, 

and the second is based on quadratic loss function: 

=tr(Ep-Ip)2. 

Each of these is when Ep — Ip and positive when Sp 7^ Ip. For Li(T,p), it is 
referred to the likeliho od ratio test (LR T) and the classic LRT for fixed p and 
large n can be found in AndersonI ()2003l ). For high dimensional data (p is large), 
the failure of class ical LRT was firs tly observed bv lDempstei ( 19581 ) and later in 
a pioneer work bv lBai et all (|2009t ). authors proposed corrections t o LRT when 
p/n — >• c S (0, 1) and /z = 0. Succe s sive w orks included I Jiang et all (|2012l ) which 
extended the results of Bai et al. ( 2009f ) to Gaussian data with general /z and 
our work lWang et al. ( 2012 ) where we studied the LRT for general fj, and non- 
Gaussian data. For the quadratic los s function £o(Ep ), there are many work s 
since the semina l pape r iNagaol (Il973l). Th e se includedlLedoit and Wolj (|2002|) 
Birke and Dettd (|20Q5l ). ISrivastaval (|2005l) . IChen et al\ (|2010t ) and lCai and Ma 
( 2012 ) . Other works which consi dered question (HI) a re referred to Ijohnstone 

200lli . lCai and JiangI (|201l[ ) and lOnatski et all (|201 ll) . 

The existing results about LRT (iBai et all . 2009 ; Jiang et al. , 2012 ; Wang et al 



2012f) have only derived asymptotic null distribution and we know little about 
the as ymptotic point- wise p ower of LRT under the alternative hypothesis. Re- 
cently, lOnatski et al. (2011) studied the asymptotic power of several tests in- 
cluding LRT under the special alternative of rank one p e rturb ation to the iden- 
tity matrix as both p and n go to infinity. ICai and Ma (2012) investigated the 
testing problem ([T]) in the high-dimensio nal s ettings from a mini max point of 
view. The results in lOnatski et all ( 201 if ) and Cai and Ma (2012) showed that 
LRT was a sub-optimal test when there was a rank one perturbation to the 
identity matrix . 

In this work, we will consider the power of LRT under general alternatives. 
The asymptotic distribution of LRT will be studied when Ep 7^ Ip and an 
explicit expression of the power will also be derived when L/(Sp) tends to a 
constant. From thes e resu hs, we find that in relation to LRT it is not fair that 
Onatski et all (j201l[ ) and ICai and Mai (|2012| ) only focused on the alternatives 
whose eigenvalues were larger than 1. Furthermore, our results show that LRT is 
powerful to detect eigenvalues around zero. Sim ulations wil l also be conducted 
to co mpare LRT with two tests based on ig(Ep) ()Chen et aLl . l2010l: l ICai and Mai . 
2OI2I) . 

The paper is structured as follows: Section 2 introduces the basic data struc- 
ture and establishes the asymptotic power of LRT while Section 3 reports sim- 
ulation studies. All the proofs are presented in the Appendix. 
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2 Main Results 



To relax the Gaussian ass umptions, we assume that the observations Xi , ■ ■ ■ , X„ 
satisfy a multivariate model ( Chen et a/.l . I2OI0I ) 

X, = ^l/^Y, + fi, fori = l,--- ,n (4) 



where is a p-dimensional constant vector and the entries of 3^„ = (^ijOpxri = 
(Yi, ■■■,¥„) are i.i.d. with EYij = 0, EY^j = 1 and £^1;^ = 3 + A. The sample 
covariance matrix is defined as 



1 " 

Sn = y^iXk - X)iXk ~ X)', 

n — 1 ^ — ' 



fc=i 

where X = ^ELi^fe- 

Writing t/„ = p/n < 1, the LRT statistic is defined as 

i„--tr(S„)-ilog|Sn|-l-d(y,) 
V P 



(5) 



where tr denotes the trace a nd d(x] = 1 + — l)log(l — x), < x < 1. 
Under the null hypothesis, Wang et al. ( 20121 ) derived the following asymptotic 
normality of i„ by using random matrix theories. 

Theorem 1 (Theorem 2.1 of I Wang et al\ (|2012l )) When Ep = Ip andy,, = 

p/v^y^ (0,1): 



7V(0,1), 



where fin = 2/ri(A/2 - 1) - 3/21og(l - y„), cr^ 
denotes convergence in distribution. 



'2yn - 2 log (1 - y„) and 



When Xi,- - - ,Xn be i -i -d. m ultivariate normal distributions Np(fj,,'E,p) 
where A = 0, iJiang et ali (j2012l ) derived a similar result as Theorem [T] by 
using the Selberg integral and they also considered the special situation where 
p/n — > 1. Based on the asymptotic normality under the respective null hypoth- 
esis, an asymptotic level a test based on L„ is given by 



fJ-n 



(6) 



where /(•) is the indicator function, and zi^q denotes the 100 x (1 — a)th 
percentile of the standard normal distribution. In the following theorem, we 
establish the convergence of Ln under the alternative Ep "j^ Ip- 

Theorem 2 When tr(I]p — Ip)^/p and j/„ — p/y y e {0, 1), 



pLn — Lli^p) — fJ-n D 



7V(0,1), 



where fin = yn(A/2 - 1) - 31og(l - y„)/2 and = -2y„ - 2 log (1 - y„ 
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In particular, when L/(Ep) tends to a constant, we have the fohowing resuh. 



Theorem 3 When Li{J^p) — > 6 e (0, oo) and yn = p/y — > y G (0, 1), 

b 



hm Psp((/) rejects Hq) = 1 - 



y/-2y-2\og{l-y) 



where $(•) is the cumulative distribution function of the standard normal dis- 
tribution. 



It can be seen from Theorems [5] and [3] that the expression 



(7) 



gives good approximation to the power of the test in ([6]) until the power is 
extremely close to 1. In particular, when L;(I]p) is large, the power of the test 4> 
will be close to 1 and it is hard for (p to distinguish between the two hypotheses 
if Li(Yip) tends to zero. 

To deriv e the asym ptotic power, a spec ial covariance matrix was used in 
Cai and Mai ()2012l ) and lOnatski et all (|201ll ) as follows 



S* = /p + h^j -vv', 



(8) 



where /i is a co nstant and v is an arbitrarily fixed unit vector. For this special 
spiked matrix (jJohnstond . 2001), the t r ue Sp has a perturbation in a single 



Cai and Mai (|2012[ ) and lOnatski et d\ (|201l[ ). the 



unknown direction and in 
authors focused on situations where h > that is one eigenvalue of Ep is larger 
than 1 while others are still unitary. Here, by Theorem |21 we know that the 
asymptotic power of the LRT test ^ is 



1 - ^(Zl-a 



V-2y-21og(l-y)' 



Therefore, compared with the tests based on Lq{T,p) ( Ledoit and WolA 120021 : 
Chen et a/.l . l2O10HCai and Mal . l2012l ) whose power for S* is 1 - <^{zi-a- h"^ /2), 



LRT is more sensitive to small eigenvalues {h < 0), not any bigger than one 
{h > 0). In particular, when 1 + h^/y is close to that is E* has a very small 
eigenvalue, the power will tend to 1. A numerical experiment will be conducted 
in the next section to show the performances of LRT (j6|) and the tests based on 



Lg(Ep). 



3 Simulations 

In this section, we conduct several simulation studies to compare the power 
of the LRT in ^ with that of the tests based on Lg(Ep). When Xi, ■ ■ ■ ,X„ 
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i.i.d from Np(0. S^V ICai and Mai (j2012l) proposed an estimator of Lq{T,p) as 



2 

Ti,n = —, -T > h{Xi,Xj 

n{n — 1) ^ 



(9) 



i<j<<ji 



where h{Xi,X2) — {X[X2)'^ — (X^Xi + X2X2) + p and an asymptotic level a 
test based on Ti^„ is given by 



/(Ti,„ > zi 



lp{p + i)_ 

n(n — 1) 



(10) 



Similarly, for general data structure, Chen et al. ( 2010[ ) gave an estimator for 

1 * 2 * 



4 E ^'^^^^'k^i " 2tr(5„) 



' pi 

where — n\/{n — r)\ and ^* denotes summation over mutually different 
indices. And the level a test is 



2p 



(11) 



To evaluate the power of the tests, the alternative population covariance matrix 
will be set as S* = diag{p, 1, • • • ,1) where p will range from 0.01 to 4. Here, 
we only focus on this simple matrix to investigate the features of each test and 
simulations for m ore general alternatives can be found in our previous work 
Wang etah ( 2012 ). All the results are based on 10'* rephcations. 



Fig ure [T] reports empirical powers of the LRT and the test of ICai and Ma 
(|2012f) (CM test) for 7Vp(0,Ep) distributions. We observe from Figure [T] that 
LRT had a better performance for p < 1 while the CM test performed better 
if p > 1. When p is around 1 that is the true Ep is very close to an identity 
matrix, both tests had similar empirical sizes which were quite close to the 
nominal 5%. For the fixed sample size n = 200, when p was increased from 50 
to 100, the performances of both tests became poor which could be understood 
as the estimators get worse as p is increased for the fixed sample size. 

For general data, due to the CM test is only applicable for iVc (0,£„), we 
conduct comparisons between LRT and the test of lChen et all I (|20inl ) (CZZ test) 
where Yij comes from Gamma(4, 0.5) distribution in data (HJ. Figure [2] reports 
the empirical powers of the LRT and CZZ tests and the conclusions follow very 
similar patterns to those of Figure [1] which shows that LRT is a powerful test 
to detect eigenvalues around zero. 
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Figure 1: Empirical performances of LRT and CM tests for Gaussian data with 
known means. 
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Figure 2: Empirical performances of LRT and CZZ tests for non-Gaussian data 
with unknown means 
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4 Appendix 

4.1 Proof of Theorem [2] 

Writing 



S« = ^V(>'fc-n(i"fe-?)', (12) 
n — I ^ — ' 



k=l 



where Y — J2k=i ^k- Noting S'„ = Ey^BnE^^, we have 
Ln = itr(S„)-ilog|Sn|-l-d(yn) 

p p 

= itr(SpB„) - itr(Bn) - - log |Ep| + BL„ 
P P P 

= BLn + i^KSp) + ^tr((Sp - Ip)Bn) - ^tr(Sp - Ip), 
where BL„ = itr(B„) - i log |B„| - 1 - d(y„). By Theorem^ 

Therefore, to prove Theorem [21 it is enough to show 

e„ := tr((Ep - Ip)Bn) - tr(Sp - Ip) = Op(l). 
Rewriting i?„ as 

n 

Bn = -Yl YkYl - Y^Y;, (13) 

n ■'^ n{n — 1) '^-^ •' 

k=i ^ ' i^j 

we can get i?[e„] — and by Proposition A.l of Chen et al. ( 2010f ). 

1 " 1 
^[^n] = - Ip)Yk ~ _,. Y.^:i^P - h)Yif\ - (tr(Sp - Ip 

fe=i ^ > i^j 



< 



2 ... A 

^2 



tr((Ep - Ip)2) + -tr((I]p - Ip) o (Sp - Ip)) 
2 + A 



-tr((Ep-Ip)^) 



where o denotes Hadaniard product. Above all, when p/n — y and tr((Ep 
Ip)^)/p — )■ 0, we come to e„ = Op(l). 
The proof is completed. 
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4.2 Proof of Theorem [3] 

To prove the theorem, we need some inequahties. 
Lemma 1 For any x > 0, 

(1) IfO<x< I, (x- 1)2 < 2(a; - 1 - logx); 

(2) Ifl<x< M, {x ~ If < 2M{x - 1 - log x). 

Since — > 6 G (0, oo), for large enough p, we will always have Di{Y,p) < 
2b. Denoting the eigenvalues of Ep as Ai.p, • • • , Ap_p, 

p 

L,(I]p) = ^(Afc,p - logAfc,p - 1) < 26, 
fc=i 

which implies that there is a constant cq = co(6) > 1 satisfying 

Cq ^ ^ '^i.pj ■ ' ' 1 '^p,p — "^o- 
By Lemma[Il (Afc,p — l)^ < 2co(Afc,p — logAfc,p — 1) which means 
p 

Xg(Ep) = ^(Afe,p - if < 2caLi{Ep) < 4co6 = o{p). 

k=l 

By Theorem [2] and Slutsky's lemma, 

P^^^^^ ^ 4 N{0, 1). 

<Tn V-2y-21og(l-y) 

Now, we can calculate the power of the test 

Ps„{'t> rejects i?o) = P{^^ — — > 

^ p,pLn_[hi_ h b 

a„ v/-2y-21og(l-y) V-22/-21og(l-2/) 

V-22/-21og(l-2/)' 
The proof is completed. 
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